Estimating population diversity with CatchAll
نویسندگان
چکیده
MOTIVATION The massive data produced by next-generation sequencing require advanced statistical tools. We address estimating the total diversity or species richness in a population. To date, only relatively simple methods have been implemented in available software. There is a need for software employing modern, computationally intensive statistical analyses including error, goodness-of-fit and robustness assessments. RESULTS We present CatchAll, a fast, easy-to-use, platform-independent program that computes maximum likelihood estimates for finite-mixture models, weighted linear regression-based analyses and coverage-based non-parametric methods, along with outlier diagnostics. Given sample 'frequency count' data, CatchAll computes 12 different diversity estimates and applies a model-selection algorithm. CatchAll also derives discounted diversity estimates to adjust for possibly uncertain low-frequency counts. It is accompanied by an Excel-based graphics program. AVAILABILITY Free executable downloads for Linux, Windows and Mac OS, with manual and source code, at www.northeastern.edu/catchall. CONTACT [email protected].
منابع مشابه
Estimating the Number of Species with Catchall
In many situations we are faced with the need to estimate the number of classes in a population from observed count data: this arises not only in biology, where we are interested in the number of taxa such as species, but also in many other fields such as public health, criminal justice, software engineering, etc. This problem has a rich history in theoretical statistics, dating back at least t...
متن کاملModels for estimating phytoplankton population densities under different environmental conditions with emphasis on climatic factors
The aim of this study is to determine the effect of environmental conditions with emphasis on the main meteorological factors (air temperature variables, sunshine hour, and humidity), on phytoplankton communities. As important primary producers in aquatic ecosystems, phytoplankton communities could be affected by several factors. Environmental factors play the major role in occurrence and diver...
متن کاملEstimation of viral richness from shotgun metagenomes using a frequency count approach
BACKGROUND Viruses are important drivers of ecosystem functions, yet little is known about the vast majority of viruses. Viral shotgun metagenomics enables the investigation of broad ecological questions in phage communities. One ecological characteristic is species richness, which is the number of different species in a community. Viruses do not have a phylogenetic marker analogous to the bact...
متن کاملEvaluating the performance of likelihood methods for detecting population structure and migration.
A plethora of statistical models have recently been developed to estimate components of population genetic history. Very few of these methods, however, have been adequately evaluated for their performance in accurately estimating population genetic parameters of interest. In this paper, we continue a research program of evaluation of population genetic methods through computer simulation. Speci...
متن کاملEstimating the diversity of peptide populations from limited sequence data
MOTIVATION Combinatorial libraries of peptides such as those displayed on the surface of a bacteriophage particle have become widely used tools for characterizing protein-protein and protein-small molecule interactions. The quality of a library frequently depends on its completeness, or diversity-the proportion of possible sequences actually present in the library. The diversity of these librar...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 28 7 شماره
صفحات -
تاریخ انتشار 2012